Modular Resultant Algorithm for Graphics Processors

نویسنده

  • Pavel Emeliyanenko
چکیده

In this paper we report on the recent progress in computing bivariate polynomial resultants on Graphics Processing Units (GPU). Given two polynomials in Z[x, y], our algorithm first maps the polynomials to a prime field. Then, each modular image is processed individually. The GPU evaluates the polynomials at a number of points and computes univariate modular resultants in parallel. The remaining “combine” stage of the algorithm is executed sequentially on the host machine. Porting this stage to the graphics hardware is an object of ongoing research. Our algorithm is based on an efficient modular arithmetic from [1]. With the theory of displacement structure we have been able to parallelize the resultant algorithm up to a very fine scale suitable for realization on the GPU. Our benchmarks show a substantial speed-up over a host-based resultant algorithm [2] from CGAL (www.cgal.org).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computing resultants on Graphics Processing Units: Towards GPU-accelerated computer algebra

In this article we report on our experience in computing resultants of bivariate polynomials on Graphics Processing Units (GPU). Following the outline of Collins’ modular approach [6], our algorithm starts by mapping the input polynomials to a finite field for sufficiently many primes m. Next, the GPU algorithm evaluates the polynomials at a number of fixed points x ∈ Zm, and computes a set of ...

متن کامل

An Efficient Parallel Spatial Subdivision Algorithm for Object-Based Parallel Ray Tracing

Parallel ray tracing of complex scenes on multicomputers requires distribution of both computations and scene data to the processors. This is carried out during preprocessing and usually consumes too much time and memory. In this paper, we present an efficient parallel subdivision algorithm to decompose a given scene into rectangular regions adaptively and map the resultant regions to the node ...

متن کامل

Accelerating Fully Homomorphic Encryption on GPUs

In a major breakthrough, in 2009 Gentry introduced the first plausible construction of a fully homomorphic encryption (FHE) scheme. FHE allows the evaluation of arbitrary functions directly on encrypted data on untwisted servers. In 2010, Gentry and Halevi presented the first FHE implementation on an IBM x3500 server. However, this implementation remains impractical due to the high latency of e...

متن کامل

Efficient parallel spatial subdivision algorithm for object-based parallel ray tracing

Parallel ray tracing of complex scenes on multicomputers requires distribution of both computations and scene data to the processors. This is carried out during preprocessing and usually consumes too much time and memory. In this paper, we present an efficient parallel subdivision algorithm to decompose a given scene into rectangular regions adaptively and map the resultant regions to the node ...

متن کامل

Spatial Scan Statistics on the GPGPU

Kulldorff’s spatial scan statistic and the software implementation (SaTScan) are widely used for the detection and evaluation of geographic clusters, particularly within the health care community. Unfortunately, the computational time of the scan statistic depends on a wide variety of variables, and, depending on the chosen parameter settings and operations, the computational time can be on the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010